Continuous Speech Recognition without End-point Detection
نویسندگان
چکیده
منابع مشابه
Continuous speech recognition without end-point detection
A new continuous speech recognition method that does not need the explicit speech end-point detection is proposed. A one-pass decoding algorithm is modified to decode the input speech of infinite length so that, with appropriate nonspeech models for silence and ambient noises, continuous speech recognition can be executed without the explicit endpoint detection. The basic algorithm is 1) decode...
متن کاملDomain-Specific Utterance End-Point Detection for Speech Recognition
The task of automatically detecting the end of a device-directed user request is particularly challenging in case of switching short command and long free-form utterances. While lowlatency end-pointing configurations typically lead to good user experiences in the case of short requests, such as “play music”, it can be too aggressive in domains with longer free-form queries, where users tend to ...
متن کاملSpeech Recognition Front End Without Information Loss
Phoneme classification is investigated for linear feature domains with the aim of improving robustness to additive noise. In linear feature domains noise adaptation is exact, potentially leading to more accurate classification than representations involving non-linear processing and dimensionality reduction. A generative framework is developed for isolated phoneme classification using linear fe...
متن کاملContinuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...
متن کاملContinuous speech recognition using phone-based anchor point detection and diphone-based dp-matching
This paper deals with acoustic-phonetic decoding for CSR. There are two different processing modules depending on the stea.dy or transient nature of the speech input. First the stea.dy state speech processing module, called phone-based anchor point detection, performs some preprocessing a.llowing the selection of only a subset of the vocabulary under consideration. Secondly a general processing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEJ Transactions on Electronics, Information and Systems
سال: 2004
ISSN: 0385-4221,1348-8155
DOI: 10.1541/ieejeiss.124.1121